RAW SEQUENCE LISTING DATE: 06/13/2000 

PATENT APPLICATION US/09/225, 080 TIME: 03 :00: 50 

INPUT SET: S3 5618. raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



ENTERED 



SEQUENCE LISJ 

General Inf ormat ion : 
(i) APPLICANT: Au-Young, Janice 

(ii) TITLE OF INVENTION: NOVEL HUMAN STEM CELL ANTIGENS 

(iii) NUMBER OF SEQUENCES: 26 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: U.S. 

(F) ZIP: 94304 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/225,080 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US/08/6 75,508 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE/DOCKET NUMBER: PF-0066 US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(2) INFORMATION FOR SEQ ID N0:1: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 



PAGE: 2 RAW SEQUENCE LISTING DATE: 06/13/2000 

PATENT APPLICATION US/09/225,080 TIME: 03:00:51 

INPUT SET: S35618.raw 

47 (C) STRANDEDNESS : single 

48 (D) TOPOLOGY,: linear 
49 

5 0 (ii) MOLECULE TYPE: peptide 

51 

52 (vii) IMMEDIATE SOURCE: 

53 (A) LIBRARY: SCAH-1 

54 (B) CLONE: 
55 

56 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

57 

58 Met Lys lie Phe Leu Pro Val Leu Leu Ala Ala Leu Leu Gly Val Glu 

59 1 5 10 15 

60 Arg Ala Ser Ser Leu Met Cys Phe Ser Cys Leu Asn Gin Lys Ser Asn 

61 20 25 30 

62 Leu Tyr Cys Leu Lys Pro Thr lie Cys Ser Asp Gin Asp Asn Tyr Cys 

63 35 40 45 

64 Val Thr Val Ser Ala Ser Ala Gly He Gly Asn Leu Val Thr Phe Gly 

65 50 55 60 

66 His Ser Leu Ser Lys Thr Cys Ser Pro Ala Cys Pro He Pro Glu Gly 

67 65 70 75 80 

68 Val Asn Val Gly Val Ala Ser Met Gly He Ser Cys Cys Gin Ser Phe 

69 85 90 95 

70 Leu Cys Asn Phe Ser Ala Ala Asp Gly Gly Leu Arg Ala Ser Val Thr 

71 100 105 110 

72 Leu Leu Gly Ala Gly Leu Leu Leu Ser Leu "Xaa Pro Ala Leu Leu Arg 

73 115 120 125 

74 Phe Gly Pro 

75 130 
76 

77 (2) INFORMATION FOR SEQ ID NO: 2: 

78 

7 9 (i) SEQUENCE CHARACTERISTICS: 

80 (A) LENGTH: 123 amino acids 

81 (B) TYPE: amino acid 

82 (C) STRANDEDNESS: single 

83 (D) TOPOLOGY: linear 
84 

85 (ii) MOLECULE TYPE: peptide 

86 

87 (vii) IMMEDIATE SOURCE: 

88 (A) LIBRARY: SCAH-2 

89 (B) CLONE: 
90 

91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

92 

93 Met Lys Ala Val Leu Leu Ala Leu Leu Met Ala Gly Leu Ala Leu Gin 

94 1 5 10 15 

95 Pro Gly Thr Ala Leu Leu Cys Tyr Ser Cys Lys Ala Gin Val Ser Asn 

96 20 25 30 

97 Glu Asp Cys Leu Gin Val Glu Asn Cys Thr Gin Leu Gly Glu Gin Cys 

98 35 40 45 

99 Trp Thr Ala Arg He Arg Ala Val Gly Leu Leu Thr Val He Ser Lys 




PAGE: 3 RAW SEQUENCE LISTING DATE: 06/13/2000 

PATENT APPLICATION US/09/225,080 TIME: 03:00:52 

INPUT SET: S3 561 8. raw 

100 50 55 60 

101 Gly Cys Ser Leu Asn Cys Val Asp Asp Ser Gin Asp Tyr Tyr Val Gly 

102 65 70 75 ^ — 80 

103 Lys Lys Asn lie Thr Cys Cys Asp Thr Asp Leu Cys Asn Xaa ]ser Gly . 

104 85 90 V^^-:/95 

105 Ala His Ala Leu Gin Pro Ala Ala Ala lie Leu Ala Leu Leu Pro'-Ala 

106 100 105 110 

107 Leu Gly Leu Leu Leu Trp Gly Pro Gly Gin Leu 

108 115 120 
109 

110 (2) INFORiVIATIGN FOR SEQ ID NO: 3: 

111 

112 (i) SEQUENCE CHARACTERISTICS: 

113 (A) LENGTH: 537 base pairs 

114 (B) TYPE: nucleic acid 

115 (C) STRANDEDNESS: single 

116 (D) TOPOLOGY: linear 
117 

118 (ii) MOLECULE TYPE: cDNA 

119 

12 0 (vii) IMMEDIATE SOURCE: 

121 (A) LIBRARY: SCAH-1 

122 (B) CLONE: 
123 

124 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

125 . 

126 NAGGANGGTG GGGGACCCAN GGGTCCAGAG CGCAGTTCGG GTCGGAGCTY CGWCCAGGCT 60 

127 GCTGGTACCT GCGTCCGCCC GGCGAGCAGG ACAGGCTGCT TTGGTTTGTG ACCTCCAGGC 120' 
12 8 AGGACGGCCA TCCTCTCCAG AATGAAGATC' TTCTTGCCAG TGCTGCTGGC TGCCCTTCTG 180 

12 9 GGTGTGGAGC GAGCCAGCTC GCTGATGTGC TTCTCCTGCT TGAACCAGAA GAGCAATCTG 24 0 

13 0 TACTGCCTGA AGCCGACCAT CTGCTCCGAC CAGGACAACT ACTGCGTGAC TGTGTCTGCT 300 

131 AGTGCCGGCA TTGGGAATCT CGTGACATTT GGCCACAGCC TGAGCAAGAC CTGTTCCCCG 360 

132 GCCTGCCCCA TCCCAGAAGG CGTCAATGTT GGTGTGGCTT CCATGGGCAT CAGCTGCTGC 42 0 

133 CAGAGCTTTC TGTGCAATTT CAGTGCGGCC GATGGCGGGC TGCGGGCAAG CGTCACCCTG 48 0 

134 CTGGGTGCCG GGCTGCTGCT GAGCCTGWTG CCGGCCCTGC TGCGGTTTGG CCCCTGA 537 
135 

136 (2) INFORMATION FOR SEQ ID NO: 4: 

137 

138 (i) SEQUENCE CHARACTERISTICS: 

139 (A) LENGTH: 494 base pairs 

140 (B) TYPE: nucleic acid 

141 (C) STRANDEDNESS: single 

142 (D) TOPOLOGY: linear 
143 

144 (ii) MOLECULE TYPE: cDNA 

145 

146 (vii) IMMEDIATE SOURCE: 

147 (A) LIBRARY: SCAH-2 

148 (B) CLONE: 
149 

150 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

151 

152 GTGACCATGA AGGCTGTGCT GCTTGCCCTG TTGATGGCAG GCTTGGCCCT GCAGCCAGGC -60 
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PATENT APPLICATION US/09/225,080 TIME: 03:00:53 

INPUT SET: S35618.raw 

153 ACTGCCCTGC TGTGCTACTC CTGCAAAGCC CAGGTGAGCA ACGAGGACTG CCTGCAGGTG 12 0 

154 GAGAACTGCA CCCAGCTGGG GGAGCAGTGC TGGACCGCGC GCATCCGCGC AGTTGGCCTC 180 

155 CTGACCGTCA TCAGCAAAGG CTGCAGCTTG AACTGCGTGG ATGACTCACA GGACTACTAC 24 0 

156 GTGGGCAAGA AGAACATCAC GTGCTGTGAC ACCGACTTGT GCAACGSCAG CGGGGCCCAT 300 

157 GCCCTGCAGC CGGCTGCCGC CATCCTTGCG CTGCTCCCTG CACTCGGCCT GCTGCTCTGG 360 

158 GGACCCGGCC AGCTATAGGC TCTGGGGGGC CCCGMTGCAG CCCACACTGG GTGTGGTGCC 42 0 

159 CCAAGGCCTC TGTGSCACTC CTMACAGACC TGGGCCCAGT GGGAGSCTGT CTCTNGGTTC 48 0 
16 0 CTGAGGCACA TCCT 4 94 
161 

162 (2) INFORMATION FOR SEQ ID NO: 5: 
163 

164 (i) SEQUENCE CHARACTERISTICS: 

165 (A) LENGTH: 136 amino acids 

166 (B) TYPE: amino acid 

167 , (C) STRANDEDNESS : single 

168 (D) TOPOLOGY: linear 
169 

170 (ii) MOLECULE TYPE: peptide 
171 

172 (vii) IMMEDIATE SOURCE: 

173 • (A) LIBRARY: GenBank 

174 (B) CLONE: 434660 
175 

176 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
177 

178 Met Ser Ala Thr Ser Asn Met Arg Val Phe Leu Pro Val Leu Leu Ala 

179 15 10 15 

180 Ala Leu Leu Gly Met Glu Gin Val His Ser Leu Met Cys Phe Ser Cys 

181 20 25 30 

182 Thr Asp Gin Lys Asn Asn lie Asn Cys Leu Trp Pro Val Ser Cys Gin 

183 35 40 45 

184 Glu Lys Asp His Tyr Cys lie Thr Leu Ser Ala Ala Ala Gly Phe Gly 

185 50 55 60 

186 Asn Val Asn Leu Gly Tyr Thr Leu Asn Lys Gly Cys Ser Pro lie Cys 

187 65 70 75 80 

188 Pro Ser Glu Asn Val Asn Leu Asn Leu Gly Val Ala Ser Val Asn Ser 

189 85 90 95 

190 Tyr Cys Cys Gin Ser Ser Phe Cys Asn Phe Ser Ala Ala Gly Leu Gly 

191 100 105 110 

192 Leu Arg Ala Ser lie Pro Leu Leu Gly Leu Gly Leu Leu Leu Ser Leu 

193 115 120 125 

194 Leu Ala Leu Leu Gin Leu Ser Pro 

195 130 135 ^ 
196 

197 (2) INFORMATION FOR SEQ ID NO : 6 : 
198 

199 (i) SEQUENCE CHARACTERISTICS: 

200 (A) LENGTH: 109 amino acids 

201 (B) TYPE: amino acid 

202 (C) STRANDEDNESS: single 

203 (D) TOPOLOGY: linear 
204 

2 05 (ii) MOLECULE TYPE: peptide 
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PATENT APPLICATION US/09/225, 080 TIME: 03:00:54 

INPUT SET: S35618,raw 

206 

207 (vii) IMMEDIATE SOURCE: 

208 (A) LIBRARY: GenBank 

209 (B) CLONE: 1199651 
210 

211 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
212 

213 Met Ser Thr Thr Ser Ser Met Arg Val Phe Ser lie Val Leu Gin Ala 

214 15 10 15 

215 His Leu Leu Gly Val Glu Leu Val Pro Ser Leu lie Cys Ser Ser Cys 

216 20 25 30 

217 Thr His Gin Lys Ser Asn lie Asn Pro Pro Trp Pro Val Ala Cys Lys 

218 35 40 45 

219 Asp Thr Gly Asn Tyr Cys lie Met Leu Phe Ser Ala Val Gly Phe Gly 

220 50 * 55 60 

221 Asn Val Asn Leu Gly Tyr Thr Leu Asn Thr Gly Cys Ser Gin Ser Cys 

222 65 70 75 80 

223 Pro His Glu Asn lie Asn lie Asn Pro Gly. Val Ala Ser Val Asn Ser 

224 85 90 95 

225 Tyr Gin Ser Ser Phe Cys Asn Phe Ser Asn Ala Cys Leu 

226 100 105 
227 

228 (2) INFORMATION FOR SEQ ID NO: 7: 
229 

23 0 (i) SEQUENCE CHARACTERISTICS: 

231 (A) LENGTH: 148 base pairs 

232 (B) TYPE: nucleic acid 

233 (C) STRANDEDNESS : single 

234 (D) TOPOLOGY: linear 
235 

236 (ii) MOLECULE TYPE: cDNA 
237 

238 (vii) IMMEDIATE SOURCE: 

239 (A) LIBRARY: THPlPEBOl 

240 (B) CLONE: 72518 
241 

242 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
243 

244 CCTGAAGCCG ACCATCTGNT CCGACCAGGN CAACTACTGN GTGACTGTGT CTGCTAGTGC 60 

24 5 CGGCATTTGG AATCTTGTGC CATTTNGACA CAGCCTNAGC AAGACCTNTT CCCCGGCCTN 120 
24 6 NCCCATCCCA GAAGGNGTCA ATNATNGT 14 8 
247 

24 8 (2) INFORMATION FOR SEQ ID NO: 8: 
249 

250 (i) SEQUENCE CHARACTERISTICS: 

251 (A) LENGTH: 196 base pairs 

252 (B) TYPE: nucleic acid 

253 (C) STRANDEDNESS: single 

254 (D) TOPOLOGY: linear 
255 

256 (ii) MOLECULE TYPE: cDNA 
257 

258 (vii) IMMEDIATE SOURCE: 



SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/225,080 



DATE: 06/13/2000 
TIME: 03:00:55 



INPUT SET: S35618.raw 



Original Text 



